Logical Unit / Data Product Overview

What is a Logical Unit?

A Logical Unit (LU or Logical Unit type - LUT), also known as a Data Product, is a blueprint data asset, engineered to deliver a trusted dataset for a specific business purpose (a Business Entity like customer, order, or loan). It holds a set of definitions and instructions used for integrating data from source systems, processing and governing the data, storing, and delivering it. The LU is the prototype from which LU Instances (LUIs) are created.

An LU is defined and configured in the Fabric Studio as a core element of the Fabric project. These definitions are comprised of 3 main types of objects:

  1. LU Table: The definition of a table within the LU Schema, with its columns, primary keys, indexes, and triggers.

  2. LU Table Population:

    • Data feeding into LU tables from a variety of data sources and keep it up to date.
    • Ability to manipulate the fed data, which includes enriching, cleansing, masking, transforming, etc.
  3. LU Schema: The relationship between the LU tables (similar to foreign keys). An LU schema has one LU table defined as its Root Table. The Root Table holds the LU’s unique key.

In addition to these main objects, there are some others that are a part of the logical unit, and they are used for defining its life cycle. They can be found in the Project Tree, under each logical unit:

Let’s use an example of a Customer 360 implementation for Company ABC:

  • LU / Data Product: Customer.
  • Data sources: CRM, Ordering, Billing and Collection, Billing and Assets.
  • LU tables: The tables that will hold the data you wish to keep about a customer from the 4 data sources.
  • LU Table Populations: The set of definitions that will be used for extracting, transforming, cleaning, aggregating, validating (etc.) the data from the 4 data sources into the LU tables.
  • LU schema: The definition of the Root Table and the relationship between all LU tables.

image

image

What Is a Logical Unit Instance (LUI)?

A Logical Unit Instance is one instance of a Logical Unit Type – it is a single physical database, which holds the data of one single Business Entity instance in the LUT structure definition. Using our example from above (Customer 360), assume that Company ABC has 35 million customers:

  • LU/LUT = Customer
  • LUI = one single customer database

Fabric will hold 35 million instances (LUIs) of the Customer LUT. That is, one physical database for each customer.

Things to Consider Before Designing an LU

Every Fabric project starts by defining its LUs. Analyze the business requirements and understand how the consuming application will use the data. Use this information to define the different Business Entities to implement and build an LU for each Business Entity.

General Recommendations for Designing an LU

Business Entity is often split between different data sources. In some cases, it is preferable to create one LU that contains all data sources. In other cases, it is more advantageous to split the LUs and create a separate LU for each data source.

In general, an LU should be based on the smallest number of data sources, as long as it represents a full view of a Data Product.

For example, if you have a Data Product called Customer, but different Customer Types (e.g. consumer and business) have different data sources, the recommended approach will be to create an LU for each subtype (in our example, different Customer Types).

Below is a pros and cons table of each alternative:

Item

LU per Business Entity

LU per Business Entity and data source

LU per Business Entity sub type

Ease of writing APIs

     

Replacing a data source

     

Small amount of data in LU

     

Maintenance, handling a less complex schema and internal relationships

     

Implementing a real-time action based on an event like a Golden Gate update, when the action depends on multiple data systems

     

Performance of real-time updates

     

Tuning the migration process

     

Note:

The file name's ambiguity is not supported within the same Logical Unit. This is not restricted by the Fabric Studio on purpose, allowing the implementor to continue the work and to update the names later. For example, if 2 Java function files with identical names were exported from other projects or libraries, they can be saved in the project in the Fabric Studio.

However, at run-time there should be no ambiguity within the LU, otherwise the server will run the first file it finds (with no commitment as to what is considered the first one).

Logical Unit / Data Product Overview

What is a Logical Unit?

A Logical Unit (LU or Logical Unit type - LUT), also known as a Data Product, is a blueprint data asset, engineered to deliver a trusted dataset for a specific business purpose (a Business Entity like customer, order, or loan). It holds a set of definitions and instructions used for integrating data from source systems, processing and governing the data, storing, and delivering it. The LU is the prototype from which LU Instances (LUIs) are created.

An LU is defined and configured in the Fabric Studio as a core element of the Fabric project. These definitions are comprised of 3 main types of objects:

  1. LU Table: The definition of a table within the LU Schema, with its columns, primary keys, indexes, and triggers.

  2. LU Table Population:

    • Data feeding into LU tables from a variety of data sources and keep it up to date.
    • Ability to manipulate the fed data, which includes enriching, cleansing, masking, transforming, etc.
  3. LU Schema: The relationship between the LU tables (similar to foreign keys). An LU schema has one LU table defined as its Root Table. The Root Table holds the LU’s unique key.

In addition to these main objects, there are some others that are a part of the logical unit, and they are used for defining its life cycle. They can be found in the Project Tree, under each logical unit:

Let’s use an example of a Customer 360 implementation for Company ABC:

  • LU / Data Product: Customer.
  • Data sources: CRM, Ordering, Billing and Collection, Billing and Assets.
  • LU tables: The tables that will hold the data you wish to keep about a customer from the 4 data sources.
  • LU Table Populations: The set of definitions that will be used for extracting, transforming, cleaning, aggregating, validating (etc.) the data from the 4 data sources into the LU tables.
  • LU schema: The definition of the Root Table and the relationship between all LU tables.

image

image

What Is a Logical Unit Instance (LUI)?

A Logical Unit Instance is one instance of a Logical Unit Type – it is a single physical database, which holds the data of one single Business Entity instance in the LUT structure definition. Using our example from above (Customer 360), assume that Company ABC has 35 million customers:

  • LU/LUT = Customer
  • LUI = one single customer database

Fabric will hold 35 million instances (LUIs) of the Customer LUT. That is, one physical database for each customer.

Things to Consider Before Designing an LU

Every Fabric project starts by defining its LUs. Analyze the business requirements and understand how the consuming application will use the data. Use this information to define the different Business Entities to implement and build an LU for each Business Entity.

General Recommendations for Designing an LU

Business Entity is often split between different data sources. In some cases, it is preferable to create one LU that contains all data sources. In other cases, it is more advantageous to split the LUs and create a separate LU for each data source.

In general, an LU should be based on the smallest number of data sources, as long as it represents a full view of a Data Product.

For example, if you have a Data Product called Customer, but different Customer Types (e.g. consumer and business) have different data sources, the recommended approach will be to create an LU for each subtype (in our example, different Customer Types).

Below is a pros and cons table of each alternative:

Item

LU per Business Entity

LU per Business Entity and data source

LU per Business Entity sub type

Ease of writing APIs

     

Replacing a data source

     

Small amount of data in LU

     

Maintenance, handling a less complex schema and internal relationships

     

Implementing a real-time action based on an event like a Golden Gate update, when the action depends on multiple data systems

     

Performance of real-time updates

     

Tuning the migration process

     

Note:

The file name's ambiguity is not supported within the same Logical Unit. This is not restricted by the Fabric Studio on purpose, allowing the implementor to continue the work and to update the names later. For example, if 2 Java function files with identical names were exported from other projects or libraries, they can be saved in the project in the Fabric Studio.

However, at run-time there should be no ambiguity within the LU, otherwise the server will run the first file it finds (with no commitment as to what is considered the first one).